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We explore possibility of using the three dimensional bispectra of the Ly-a? forest and the redshifted 
21-cm signal from the post-reionization epoch to constrain primordial non-Gaussianity. Both these 
^ , fields map out the large scale distribution of neutral hydrogen and maybe treated as tracers of 

■ the underlying dark matter field. We first present the general formalism for the auto and cross 

bispectrum of two arbitrary three dimensional biased tracers and then apply it to the specific case. 
We have modeled the 3D Ly-a transmitted flux field as a continuous tracer sampled along ID skewers 
which corresponds to quasars sight lines. For the post reionization 21-cm signal we have used a linear 
" bias model. We use a Fisher matrix analysis to present the first prediction for bounds on / NL and 

the other bias parameters using the three dimensional 21-cm bispectrum and other cross bispectra. 
The bounds on / NL depend on the survey volume, and the various observational noises. We have 
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considered a BOSS like Ly-a survey where the average number density of quasars n = 10 -3 Mpc" 



and the spectra are measured at a 2-<j level. For the 21-cm signal we have considered a 4000 hrs 
observation with a futuristic SKA like radio array. We find that bounds on /nl obtained in our 
analysis (6 < A/nl < 65) is competitive with CMBR and galaxy surveys and may prove to be 
an important alternative approach towards constraining primordial physics using future data sets. 
C*| . Further, we have presented a hierarchy of power of the bispectrum-estimators towards detecting 

O f the / NL . Given the quality of the data sets, one may use this method to optimally choose the 

right estimator and thereby provide better constraints on /nl- This shall be important in the quest 
O , towards understanding the mechanism behind the generation of primordial perturbations. 
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& ■ PACS numbers: 98.80.Cq, 98.62.Ra 
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INTRODUCTION 

> 

_ In the study of inflationary cosmology, probing the deviations from Gaussian initial conditions offer valuable insights 
£• — ■ to our understanding of the mechanisms that generated primordial fluctuations. Whereas, departures from non- 
Gaussianity is very small for the standard single inflaton models [1], a wide class of theories predict a moderate to 
high non-Gaussianity [2]. Measuring the degree of primordial non-Gaussianity hence, directly allows us to narrow 
down the range of viable inflation models. The bispectrum or the three point correlation function of the density 
fluctuations is a standard quantifier of non-Gaussianity and the bispectrum of the CMBR temperature anisotropics 
and large scale structure have been extensively studied in this regard [3, 4]. A recent work has also proposed the use 
of three dimensional Ly-a forest to measure primordial non-Gaussianity [5]. The bispectrum of any good tracer of 
the underlying density field is, however a potential probe of primordial non-Gaussianity. 

The neutral hydrogen (HI) distribution in the the post reionization IGM is a powerful cosmological probe of the 
low rcdshift universe. The complex astrophysical processes that dictate the HI distribution during the epoch of 
reionization, becomes largely irrelevant at redshifts z < 6. Here, two astrophysical systems are of observational 
interest. The dense and self shielded Damped Lyman a (DLA) systems [6] house bulk of the HI and is the dominant 
source of the 21-cm signal [7] (seen in emission). HI density fluctuations in the predominantly ionized IGM, on the 
contrary is responsible for the distinct absorption features - the Ly-a forest [8], in the spectrum of back ground 
quasars. On large cosmological scales both the Ly-a forest and the redshifted 21-cm signal are believed to be biased 
tracers of the underlying dark matter distribution. 

The possibility of measuring the cross correlation power spectrum of the 21-cm signal and Ly-a forest has been 
proposed [9] as a means to bypass some of the observational issues. The two signals being tracers of the underlying 
large scale structure are expected to be correlated on large scales. However, foregrounds and other systematics are 
believed to be uncorrelated between the two independent observations. Hence, the cross correlation power spectrum 
signal if detected will clearly ascertain its cosmological origin. It is only natural to generalize the notion of cross 
power spectrum to cross-bispectrum of the two fields. The cross-bispectrum is an obvious byproduct of the two data 
sets that are used to measure the auto correlations. Further, an advantage of the cross-correlation study is that, in 
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it, the demerits of one of the data sets gets partly compensated by the merits of the other. 

The auto correlations of both the 21-cm maps [10] and the Ly-a forest [5] towards a measurement of primordial 
non-Gaussianity has been independently studied. In this paper we investigate the three dimensional cross-bispectrum 
of these cosmological fields. We set up the general formalism for calculating the cross-bispectrum of two arbitrary 
low redshift tracers of the underlying matter distribution. The general formalism is then applied to estimate the 
cross-bispectrum of 21-cm brightness temperature and Ly-a transmitted flux fields. We use a Fisher matrix analysis 
to investigate the possibility of constraining the non-Gaussianity parameter /nl from the cross-bispectrum signal for 
a range of observational parameters. A comparison between the various auto and cross bispectra is also presented. 
We note that this is the first direct investigation of primordial non-Gaussianity using bispectrum of the entire three 
dimensional information contained in the distribution of low redshift neutral hydrogen. 



FORMALISM 



On sub-horizon scales the matter overdensity field At in Fourier space is related to the primordial gravitational 
potential as A^(z) = Ai(k, z)<I>£ llm = — | ^ T ^ Z3 + (z)$^ r ' m , where T(k) denotes the matter transfer function and 
D + (z) is the growing mode of density fluctuations. The statistical properties of A^ are quantified through the 
n— point correlations. The first two non-trivial of these are the power spectrum and bispectrum, defined respectively 
as (AkjAj.;,} = Sjj(ki + k 2 )P(k 1 ) and (A^ Ak 2 Ak 3 ) = <5o(ki + k2 + k3)_B(fci, k 2 , k 3 ). The primordial gravitational 
potential is written as $P rml = G + £^(<j) G — (4>' G ))i where 4>g is a Gaussian random field and non-Gaussianity is 
quantified using a single parameter /nl- The linear power spectrum of a sufficiently smoothed density field is given 
by P L (k) = k fp h± -P+( z )-P<]>" m ; where P$ llm denotes the primordial power spectrum of the gravitational potential 

such that P$ 1>m = P$g +0(/j L )- The power spectrum P$g arising from the Gaussian field 4>g does not exhibit exotic 
features and is scale invariant. In a linear theory the bispectrum arising from non-Gaussianity in the primordial 
matter field is given by B L i23 = M(k 1 )M(k 2 )M(k 3 )B ! pG 1 23 where the notation 123 = (hi, k 2 , fc 3 ) and we have used 
the form of B^,Gi 23 from earlier works [5, 11]. 

The measurement of n— point functions of the matter density field is usually implicit. One actually measures the 
statistical properties of the low redshift biased tracers. At lower redshifts one needs to incorporate non-linear structure 
formation caused by gravitational instability. This induces additional non-Gaussianity in addition to the contribution 
from primordial sources. The additional contribution to the matter bispectrum, B>f 2 \ [5, 11] is obtained from the 
second order perturbation theory. The total matter bispectrum is hence given by the sum -B123 = -B123 + -B123 , where 
a possible contribution from the primordial trispectrum has been ignored. 

Let us consider tracer fields denoted by R and S which are related to the underlying matter overdensity field as 

6 (2) 

R(k) [S(k)] = ^A(k) + ^A(k) 2 . (1) 

We are interested in the statistical properties of these tracers. The cross-correlation power spectrum Prs for the 
tracer fields and bispectra Errs, in the leading order are related to the matter power spectrum and the bispectrum 

as, 



P RS (k) = b£bg>P(k) 



B RSS = " H I a [B 123 + £?23i + B312] + ^ 24 1) 4 1) 4 2) 0P(fc 1 )P(fc 2 ) + cyc.) 



+ ^ 2) 4 1) 4 1) ( J P(fci) J P(fe) + cyc)j . (2) 

We note that for R = S we have the auto-correlation power spectrum and bispectrum. Following the formalism in 
[11] we define the generalized bispectra estimators B t with e = R, RSS as 



Be = [ d 3 qi f d 3 q 2 / d 3 q 3 ^(qi2 3 ) x U e (k u k 2 ,k 3 ). 

Vl23 Jk ± Jk 2 Jk a 



(3) 



Here qi 23 = qi + q2 + q3, Vf = (2ir) 3 /V where V is the survey volume, V 123 = J ki d 3 qi J fa <i 3 q 2 J k3 d 3 q 3 ^D(qi23) 
and the integrals are performed over the qi— intervals (ki — 4^-, ki + The integrands are given by Il/jsg = 

(l/3)[A^j(fci)A|.(A:2)A^(fc3)+cyc.]. The 'observed' fields /S° s (ki) or A R (ki) in Fourier space contain noise contributions 
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and are in general different from the respective cosmological fields As(ki) and An(k{). The estimators defined above 

-2 -2 - 

are unbiased and their variances can be calculated using the relation AB t = (B € ) — (B e ) . In the leading order this 
yields the following 

AB RSS 2 = [t (P£ ot (fci)P s Tot (fc 2 )P s Tot (fc3) + eye.) + 2t (P^ik^ik^P^ih) + eye.)] . (4) 

y ^123 

As earlier, R — S gives the variance for the auto correlation bispectrum. The combinatorial factor t = 6, 1 for 
equilateral and scalene triangles respectively and P Tot includes any additional noise power spectra along with the 
cosmological power spectrum. Armed with the results in equation (2) and equation (4), we shall apply the general 
formalism to the two tracer fields of interest namely the Ly-a forest spectra and redshifted 21-cm signal, both of 
which are related to the neutral hydrogen distribution. 

Observations indicate that, following the epoch of reionization, bulk of the neutral gas is contained in the self 
shielded DLA systems [6]. The collective 21-cm emission from these clouds is known to form a diffused background 
in low frequency radio observations. We use St to denote the fluctuations in the brightness temperature of redshifted 
21-cm radiation. We are interested in the redshift range z < 3.5 whereby it is reasonable to assume that 5t is a biased 
tracer of the underlying dark matter distribution. The numerical simulations of the post reionization 21 -cm signal 
[12, 13], show that a constant scale independent bias model is adequate to describe the large scale distribution of the 
21-cm signal at the relevant redshifts. 

The transmitted flux J- through the Ly-a forest may be modeled by assuming that the gas traces the underlying 
dark matter distribution [15] except on small scales where pressure plays an important role. Further, it is believed 
that photo-ionization equilibrium that maintains the neutral fraction, also leads to a power law temperature-density 
relation [16]. On large scales it is reasonable to assume that the fluctuations in the transmitted flux djr = (F/T — l) 
traces the dark matter field, with the implicit assumption that the Ly-a forest spectrum has been smoothed over 
some suitably large length scale. 

With all the assumptions discussed above we may relate fluctuations in Fourier space for both the 21-cm temper- 
ature and Ly-a transmitted flux to the dark matter fluctuations as in equation (1). Note, that this model depends 

(1) C2) fl) (2) 

on 4 parameters (wp , , b-p , b-p ) apart from the large scale dark matter distribution and the underlying cos- 
mological model. For the 21-cm brightness temperature field we have (for the WMAP7 cosmological parameters) 
= 4.0 mK 621 Shi (1 + z) 2 Hq/H(z), where xm is the mean neutral fraction. The neutral hydrogen fraction is 
believed to be a constant with a value xhi = 2.45 x 1CP 2 using n gas ~ 10~ 3 [17]. The quantity 621 (k, z) is studied in 
numerical simulations [12, 13]. We adopt a constant bias 621 = 2 at our fiducial redshift z = 2.5 from these simulation 

results. For the Ly-a forest, we use an approximate bjf ~ —0.13 from the numerical simulations of Ly-a forest[14]. 
We note however that these numbers suffer from large uncertainties owing to our lack of a complete model of the 
IGM. 

A Ly-a survey usually covers a large volume, as compared to a single field of view redshifted 21-cm observation. 
The cross-correlation can however be computed only in the region of overlap between the two fields. 

For the Ly-a forest flux measurements the observed flux fluctuations in Fourier space is given by A^r(k) = p(k) ® 
Ajr(k) + Ajr noise (k), where p is the sampling window function in Fourier space, and Ajr noisG (k) denotes a possible 
noise term. The function takes care of the discrete sampling of the skewers corresponding to the quasar lines of sight. 
In the variance calculation using equation (4) we use Pjr ot {k) as the total power spectrum of Ly-a flux given by [18] 

Pj ot (k) = PjrQt) + P> D (fc,|)Pw + N?, (5) 

where P^(k) = b^ P(k), the quantity Pjr D (fc||) is the usual ID flux power spectrum [15] corresponding to individual 
spectra given by PjP(k^) = {2n)~ 2 J d 2 k±Pjr(k) and Pw denotes the power spectrum of the window function. The 
quantity Nj? denotes the effective noise power spectra for the Ly-a observations. The 'aliasing' term PjP(k\\)Pw 
quantifies the discreteness of the ID Ly-a skewers. If the Ly-a spectra are measured with a sufficiently high SNR, it 
maybe shown that each quasar line of sight may actually be given the same weight and Pw = h, where n is the 2D 
density of quasars (n = Nq/A, where A is the area of the observed field of view) [19]. We assume that the variance 
ajr N of the pixel noise contribution to Sjr is the same across all the quasar spectra whereby we have Nj- — cr^ N /n 
for its noise power spectrum. This is related to the signal to noise ratio on the continuum and depends on the size 
of the pixel. Hence ctjc N = {P)~ 2 [S/N\^ x {Ax/l Mpc). We shall henceforth refer to this noise for S/N in 1A pixels. 
In arriving at equation (5) the effect of quasar clustering [20] has been ignored with the assumption that dominant 
contribution to noise comes from the Poisson term for Quasar surveys with realistic values of n. 
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For the 21-cm observations one may ignore the discrete nature of the DLA sources. We consider a radio- 
interfcrometric measurement of the rcdshiftcd 21-cm signal with the total signal is given by 

pTot = p T ( k ) +A r T . (6) 

The cosmological component of the signal is Pr(k) = (b^ ) 2 P(k) and the noise power spectrum At at an observed 

frequency v = 1420/(1 + z)MHz can be calculated using the relation N^(k,v) = ( -j- J ^ [21]. Here T sys 

denotes the sky dominated system temperature, B is the observation bandwidth, to is the total observation time, r v 
is the comoving distance to the redshift z, I is the comoving length corresponding to the bandwidth B, nj,(U, v) is 
the number density of baseline U , where U = kj_r„/2ir, and A e is the effective collecting area for each individual 
antenna. We may write nb{U, v) = N(N — 1)/2d{U, where N is the total number of antenna in the radio array 
and Jid^U^v) is the normalized baseline distribution function [22, 23]. 

We use a Fisher matrix analysis to constrain /nl from the observables of relevance here, namely the auto and cross 
bispectra B (the index e is dropped for brevity). The Fisher matrix for a set of model parameters pi is given by 



E E E 

ki=k min k 2 =k mi „ k 3 =k„ 



F - V V V 1 dB dB 

2^ 2^ 2^ (AB) 2 d Pl d Pj ' 1 ' 



where k min = max(fc mirl , \ ki — fc 2 |) and the summations are performed using 6k = k min . Assuming that the likelihood 
functions for Pi are Gaussian distributed, the corresponding minimum errors in Pi are limited by the Cramer-Rao 
bound of = F~ , We have modeled the Ly-a and 21-cm signals using a set of bias parameters and the non-Gaussianity 

parameter so that P i = (/nl, ^j^' ^^). We shall use the Fisher matrix to investigate the power of a Ly-a 

survey and 21-cm observation to constrain these parameters. 

RESULTS AND DISCUSSION 

We consider a radio interferometric observation of the redshifted 21-cm signal at a fiducial redshift z = 2.5. The 
choice of redshift is justified by a apriori knowledge of quasar distribution which is known to peak between z = 2 
and z = 3. We consider a futuristic radio observation with a SKA 1 like radio array with 1400 antennas, each offering 
an effective area of 45 m 2 ( fov ~ 7r8.6 2 deg 2 ), spread out with an approximately uniform baseline distribution. The 
observing central frequency is taken to be 406 MHz corresponding to z = 2.5. The system temperature is dominated 
by the sky temperature and is assumed to be of the form T sys = 60 [j^f] + 50 K [24]. The noise estimates 

depend crucially on the volume of the survey We take k m i n = 0.012 Mpc -1 and k max = 0.2 Mpc -1 for a single field 
of view observation and consider a total observing time of 4000 hrs. 

Typically, the Ly-a surveys have a much larger sky coverage in the transverse direction than single field radio 
observations. In the radial direction, the frequency bandwidth of radio telescope dictates the volume of the 21-cm 
survey. For the Ly-a forest observations parts of the spectra contaminated by Ly-/3 or O — VI lines at one end and 
quasar proximity effect on the other end, may not be used and the correlation can me measured in a smaller redshift 
interval. The cross-correlation can however only be measured in the overlapping regions. We have considered a Ly-a 
spectroscopic survey with n = 10~ 3 Mpc~ 2 which is assumed to be constant across the redshift range of interest. 
We note that n shall in general be dependent on the central redshift and magnitude limit of the survey. We also 
assume that the spectra are measured with a S/N = 2 for 1A pixels. We consider a BOSS 2 like survey which gives 
us k m i n = 2x 10~ 3 Mpc" 1 and k max = 1 Mpc" 1 respectively. The cross-bispectrum between the 21-cm signal and 
Ly-a forest may either include two Ly-a component and one 21-cm component or vice- versa. We denote these as 
$ttt and Bttf respectively. Both of these can be estimated from the same data sets. However, the relative merits 
depends on the nature of the observations and quality of the data. 

The cross correlation may be computed in the volume spanned by the 21-cm observation, which is smaller than the 
coverage of the Ly-a survey. Further, the signal with higher resolution (in this case Ly-a forest) has to be smoothed 
to match the poorer resolution of the other field. The bispectra are then obtained for arbitrary triangles assuming 



1 http://www.skatclescope.org 

2 http://cosmology.lbl.gov/BOSS/ 
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FIG. 1: The 68.3%, 95.4% and 99.8% likelihood confidence contours for / NL and various bias parameters. Shown in the figure 
are the values fc m ;n and k max (unit Mpc -1 ) used to compute the Fisher matrix for different estimators mentioned in the plots. 
The bispectrum estimators used for analysis are .FTT, TTT, TTT, TTT (top to bottom). 



that the primordial power spectrum is generated by a quadratic inflaton potential and the background cosmological 
parameters are frozen to their best fit values from WMAP7 year data [25]. 

We first consider £>.ftt- The Fisher matrix analysis is used to obtain bounds on the parameters (/nl,^t > )• 
The Likelihood function is assumed to be a Gaussian, which yields the confidence ellipses in the parameter space 
centered at the fiducial values of the corresponding parameters. Figure I (top panel) shows these results for 2?jftt- 
The tilt of the ellipse measures the correlation between the parameters, and the semi major/minor axes measures the 
maximum uncertainties in measurement of the parameters. The correlation between parameters pi and pj is measured 

using a coefficient r y = F^ 1 / J F^ 1 Ffj 1 . The strong correlation between and bj} (r ~ 0.997) reflects the fact 
that these parameters occur as a product in the cross-powerspectra. We find that it is possible to constrain /nl at a 
level A/nl = 17 for the given observational parameters. When we consider B^j^jr (see Figure (1)) the bounds on / NL 
worsens and we have A/nl = 64. This is because - as compared to B^j^jr the quantity Bjr TT has a greater dependence 
on the 21 cm signal which appears twice in it and the SKA parameters chosen here makes the 21-cm signal less noisy 
as compared to the 2 — tr Ly-a spectra. The situation would be reversed if the Ly-a forest had high SNR and the 
21-cm observation was relatively more noisy. The numerical results are summarized in the table (I). For the cross 
bispectra, the volume of the overlap region dictates the sensitivity. Between the two fields being used to compute the 
bispectrum, the one which appears twice should have higher SNR, to yield better constraints on the parameters. 

For the auto bispectra, the survey volume and the observational noise decides the errors on the parameters being 
estimated. For the 21-cm signal and Ly-a forest auto correlation we choose the parameters (/nLj^t , ) anc ^ 



G 



(11 (2) (2.) (2t) 

(/nl, bjr , bjr ) respectively. We use the fiducial values b^' = b-p = in our Fisher analysis. For the Ly-a forest, 
the SNR for the individual spectra is low. This is compensated by the large survey volume. With the given noise 
parameters, the constraint on /nl from the 21-cm and Ly-a bispectrum are of the same order with bounds A/nl = 9.4 
obtained from Bj^j^jr and A/nl = 6.3 obtained from £>ttt- A strong correlation between the linear and quadratic 
bias parameters is also seen in both the cases (see Figure (1) lower two panels). We note here that this is the first 
prediction of constraints on /nl from 21-cm signal and Ly-a forest using the three dimensional analysis with the 
Bispectrum estimator for arbitrary triangular configurations. 
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TABLE I: The bounds on various parameters and their correlations obtained from Fisher analysis for different bispectrum 
estimators. Note that the parameter p\ denotes / NL in all the estimators. 
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FIG. 2: The bounds on / NL for different estimators as a function of k m in assuming same SNR and observation time. Clearly 
this plot indicates a hierarchy of power to constrain / NL between different estimators. However, this hierarchy is specific to the 
observational parameters and noise levels associated with the two fields. 

As noted earlier, the volume of the survey plays a crucial role in the estimation of the parameters. It fixes the 
largest scales k m i n that one may probe. Figure 2 shows the variation of the error on /nl with k m i n for a given set of 
experimental parameters. It indicates that, for all the bispectra estimators considered, there is a monotonic increase of 
A/nl with k m i n . The constraints on /nl hence, gets worse when k m i n is increased. This is expected, since increasing 
kmin leads to the availability of lesser number of k-modes. 
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In our analysis, the observational parameters have been kept constant throughout. In reality the relative merit or 
demerit of a given estimator will depend on the noise in the data sets. The complete exploration of the full range of 
observational parameters for the two fields is required to be done to judge which estimator yields tighter constraint 
on /nl as compared to the others and for what values of the observational parameters can that be achieved. 

Several observational issues pose severe hindrance towards detecting the cosmological signals. For the 21-cm signal, 
astrophysical foregrounds which are several order higher than the signal completely submerge the cosmological infor- 
mation [26]. Several methods using the distinct spectral behavior of the foregrounds as compared to the signal has 
been proposed to clean the observed maps and retrieve the cosmological signal [27]. In this work we have assumed 
that for the 21-cm field, foreground cleaning has been done. For the Ly-a forest, a host of issues need to be addressed. 
This includes a proper modeling and subtraction of the continuum, tackling of contamination from metal lines in 
the forest [28], effect of Galactic super winds, to mention just a few. Rcdshift space distortion plays a crucial role 
on scales of our interest and is required to be incorporated in our analysis. We intend to take this up along with a 
detailed analysis of observational aspects and instrumental noise in a future work. 

We conclude by noting that the three dimensional distribution of the post-rcionization neutral hydrogen as probed 
using the 21-cm or the Ly-a forest, may be potentially used to estimate various auto/cross bispectra and thereby 
enrich our understanding of the early Universe. 
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